A hybrid deep-learning approach for complex biochemical named entity recognition
نویسندگان
چکیده
Named entity recognition (NER) of chemicals and drugs is a critical domain information extraction in biochemical research. NER provides support for text mining reactions, including relation extraction, attribute metabolic response relationship extraction. However, the existence complex naming characteristics biomedical field, such as polysemy special characters, make task very challenging. Here, we propose hybrid deep learning approach to improve accuracy NER. Specifically, our applies Bidirectional Encoder Representations from Transformers (BERT) model extract underlying features text, learns representation context through Bi-directional Long Short-Term Memory (BILSTM), incorporates multi-head attention (MHATT) mechanism chapter-level features. In this approach, MHATT aims abbreviations efficiently deal with problem inconsistency full-text labels. Moreover, conditional random field (CRF) used label sequence tags because probabilistic method does not need strict independence assumptions can accommodate arbitrary information. The experimental evaluation on publicly-available dataset shows that proposed achieves best performance; particular, it substantially improves performance recognizing abbreviations, polysemes, low-frequency entities, compared state-of-the-art approaches. For instance, accuracies entities produced by BILSTM-CRF algorithm, those two datasets (MULTIPLE IDENTIFIER) have been increased 80% 21.69%, respectively.
منابع مشابه
Named Entity Recognition in Persian Text using Deep Learning
Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...
متن کاملDeep Active Learning for Named Entity Recognition
Deep neural networks have advanced the state of the art in named entity recognition. However, under typical training procedures, advantages over classical methods emerge only with large datasets. As a result, deep learning is employed only when large public datasets or a large budget for manually labeling data is available. In this work, we show that by combining deep learning with active learn...
متن کاملHybrid Approach for Named Entity Recognition
This paper proposes the Named Entity Recognition (NER) system for Punjabi language using a hybrid approach in which rule based approach and machine learning approach i.e. Hidden Markov Model (HMM) is combined. With no Dataset available, the Named Entities (NEs) were manually tagged which led us to the creation of training and testing dataset, under the linguistic supervision. Using hybrid appro...
متن کاملA Hybrid Learning for Named Entity Recognition Systems
This paper presents a hybrid method using machine learning approach for Named Entity Recognition (NER). A system built based on this method is able to achieve reasonable performance with minimal training data and gazetteers. The hybrid machine learning approach differs from previous machine learning-based systems in that it uses Maximum Entropy Model (MEM) and Hidden Markov Model (HMM) successi...
متن کاملA Hybrid Approach for Named Entity Recognition in Indian Languages
In this paper we describe a hybrid system that applies maximum entropy model (MaxEnt), language specific rules and gazetteers to the task of named entity recognition (NER) in Indian languages designed for the IJCNLP NERSSEAL shared task. Starting with named entity (NE) annotated corpora and a set of features we first build a baseline NER system. Then some language specific rules are added to th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Knowledge Based Systems
سال: 2021
ISSN: ['1872-7409', '0950-7051']
DOI: https://doi.org/10.1016/j.knosys.2021.106958